arXiv:1509.07150v4 [math.PR] 12 Oct 2015 


Generalized Mittag Leffler distributions 
arising as limits in preferential 
attachment models 

Lancelot F. JameP^ 

^Lancelot F. James 

The Hong Kong University of Science and Technology, 

Department of Information Systems, Business Statistics and Operations Management, 
Clear Water Bay, Kowloon, Hong Kong, 
e-mail: lancelotOust .hk 

Abstract: For 0 < o < 1, and 9 > —a, let (5'” 0 _|_^){r>o} denote an in- 
creasing(decreasing) sequence of variables forming a time inhomogeneous 
Markov chain whose marginal distributions are equivalent to generalized 
Mittag Leffler distributions. We exploit the property that such a sequence 
may be connected with the two parameter (q, 0) family of Poisson Dirichlet 
distributions with law PD(q;, 0). We demonstrate that the sequences serve 
as limits in certain types of preferential attachment models. As one illustra¬ 
tive application, we describe the explicit joint limiting distribution of scaled 
degree sequences arising under a class of linear weighted preferential attach¬ 
ment models as treated in Mori [3^ . with weight /3 > — 1. When ^ = 0 this 
corresponds to the Barbasi-Albert preferential attachment model. We are in 
fact primarily interested in distributional properties of a,nd 

related quantities arising in more intricate exchangeable sampling mech¬ 
anisms, with direct links to nested mass partitions governed by PD(a, 0). 

We construct sequences of nested (a, 9) Chinese restaurant partitions of [n]. 

From this, we identify and analyze relevant quantities that may be thought 
of as mimics for vectors of degree sequences, or differences in tree lengths. 

We also describe connections to a wide class of continuous time coales- 
cent processes that can be seen as a variation of stochastic flows of bridges 
related to generalized Fleming-Viot models. Under a change of measure 
our results suggest the possibilities for identification of limiting distribu¬ 
tions related to consistent families of nested Gibbs partitions of [n] that 
would otherwise be difficult by methods using moments or Laplace trans¬ 
forms. In this regard, we focus on special simplifications obtained in the 
case of a = 1/2. That is to say, limits derived from a PD(l/2|t) distribu¬ 
tion. Throughout we present some distributional results that are relevant 
to various settings. We close by describing nested schemes varying in (q, r). 
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1. Introduction 


For each 0 < a < 1, let S'a denote a random variable whose law coincides with a 
positive stable random variable with index a specified by its Laplace transform 


E[e 


— UjSa 


= e 


and density denoted as fait)- Now define the variables Sa,e 


for each 6 > —a, as having a density, denoted by fa, 0 : formed by polynomially 
tilting a stable density as follows 

fa,eit) = Cafit~^ fait) ( 1 - 1 ) 

where Ca,e ■= r(0 + l)/r(0/a + 1), and satisfies for <5 + 0 > —a 

, _ r(0 + i) Eiy-w), _ r(ga + i) r(fl + i) 

+ J-r(0 + 5 + i)r(0/a + i)- 


The variable 


= Q is often referred to as having a Mittag-Leffler distribu¬ 


tion, hence it is natural to consider S~'^ as generalized Mittag-Leffler variables. 
In terms of combinatorial objects, versions of such variables arise as limits in the 
two parameter Poisson Dirichlet framework of as follows: Let (Pfc){fe>i} be 
the collection of ranked probability masses summing to 1, whose law, denoted 
as PD(a,0), follows a Poisson-Dirichlet distribution with parameters (a,0) as 
described in Pitman and Yor [d^ and Pitman M\- From those works, it fol¬ 
lows that letting denote the number of blocks in a PD(a,0) partition of 
[n] = {1,2,..., n}, that is to say the well-known two parameter (a, 9) Chinese 
restaurant process, then as n —>■ oo, n~°‘K-n almost surely. There are of 

course other known as ymp totic results involving the number of blocks of a cer¬ 
tain size etc. Following (dll dd, d^, a version of 5'“^ may be interpreted in terms 
of the local time up to time 1 of a generalized Bessel process. The following 
relation shows that Sa,e is a measurable function of the (Pfe) ^ PD(a,0) which 
makes sense of conditioning iPk)\Sa ,9 = t; Sa,e ■= limi_>.oo (iF(l — a)Pi) 
almost surely. For general a, they also arise in various Polya urn and random 
tree growth models as described in for instance in [ 2 ^, [s^ . There are however 
instances where the limit is not recognized. 

We also note that 5*^/2 e = 2Gg^i,^, where Gg+ 1/2 denotes a Gamma(0 -I- 
1/2,1) variable. Random variables that are powers of gamma variables have 
played a key role in recent work by 15, sl, 3^ d^, ^|. One can surmise that such 
results for a = l^can possibly be adapted for the general a case. Indeed, the 
works of [3 El Ho, [IS 1311,111 have shown that Sa.g satisfies many interesting 
distributional identities demonstrating a notion of a beta-gamma-stable alge¬ 
bra. However, for example, one important problem considered in Pekoz, Rollin 
and Ross [11 are results related to the scaled limiting distribution of the joint 
degree distribution of linearly weighted variations of the Barabasi-Albert d 
preferential attachment model. This requires quite specific information about 
the joint distributional behavior between variables in the limit. This is consid¬ 
erably more challenging in the general a setting. Although of interest, our aim 
in this present setting is not to mimic or adapt the methods in 3 3 il , but 
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rather to provide more details and indicators on what types of limits might arise 
in various models, in n, having random limits when scaled by n“. 

Let Ba,b denote a Beta(a, b) variable. James 3lj notes there are versions of 
the generalized Mittag Leffier variables that satisfy the following exact equality 
for any 9 > —a, 


Q — Ct Q — CX TDOi _ Q — OL TD 

^a,6 — ^a,e+a^(e+a,l-a) “ ^a,0+1^,1^) 


(1.3) 


where independent of 


) and Bf^e+a is independent of 
Sa, 0 +i. [See [30|, eq. (2.11)] for an in distribution version of this result applied 
to a wider range of parameters]. By recursion, this leads to two sequences form¬ 
ing Markov chains. For an integer r > 0 an interpretation, in terms of size biased 
deletion of excursion intervals of certain generalized Bessel bridges, of the first 
such sequence (SZn | „„.)T->n}, as is well known, may be read from Perman, Pit¬ 
man and Yor [4ll Corollary 3.15]. Perhaps more simply, the sequence represents 
a dual Markov chain corresponding to the operation of size biased deletion and 
insertion as described in Pitman and Yor|49l Proposition 34 and 35]. The se¬ 
quence encodes such operations relative to a nested family of (PD(a, 0-|-ra))j.>o 
distributions. Reading for r increasing describes the states relative to a deletion 
operation. 

Our primary interest is in the second sequence (>S'“g_|_j,){r.>o} which encodes 
Markov chains for the following family of distributions (PD(q;, 9 + r))r>o- Specif¬ 
ically this is encoded by the recursion formed from the equality, 




(1.4) 


The family can be seen to coincide with discrete dual fragmentation coagula¬ 
tion operations described in [3,El, although the particular role of the sequence 
e+,.){r>o} is not emphasized. These authors, as mentioned in [2l|, Remarks 
p.l712], do cite relations of their constructions to random recursive trees and 
other trees and graphs constructed under preferential attachment. One could say 
that (5'“g^^){r>o} is a family of generalized Mittag Leffier distributions under 
a PD(a, 9) discrete coagulation or fragmentation regime. We shall simply refer 
to (S- e+r.){r>o} ns a PD(a,0) sequence. The states (PD(a, 0-I-r))j.>o, read for 
r increasing correspond to fragmentation schemes. 

The PD(a, 1 — a) case arises in Haas, Miermont, Pitman and Winkel 26[. 


There the sequence “_|_i_tj){r->o} is interpreted as increasing lengths of nested 
families of trees. The general Markov chain associated with (S'“g^^){r>o} sub¬ 
ject to a change of measure, is presented in James [El which we shall reproduce 
here. One could also deduce this from [^, Proposition 18], by a change of mea¬ 
sure, since their result involves any a. Subsequent to these, the Markov chain 
based on constructions in Haas and Goldschmidt 2^ involves a PD(a,a) se¬ 
quence for a < 1/2. We note further that the M and L preferential attachment 
models in [sj correspond to the case of PD(l/2,0) and PD(l/2,1/2), respec¬ 
tively. 

Both sets of Markov chains are well defined when conditioned on Sa g = t, 
leading to sequences governed by a PD(al<), law as defined in Pitman 4^, 44j. 
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That is if {Pk,o){k>i} is the mass partition having law PD(a,0), then its condi¬ 
tional distribution is PD(a|t), the distribution of the families i{Pk,r)k>i){r>i} 
in the Markov chain are then determined by transition rules of known form. The 
PD(l/2|t) case has special cancellation properties, which under the regime of 

_ 1 /Q 

the first sequence {S 1/2 e+r/ 2 ){r>o}t translates into constructions for the stan¬ 
dard additive coalescent and Brownian fragmentation processes in ii El 13. 
Here we shall present some details for the PD(l/2|t) sequence under the second 
regime. 

For general a, these conditioning arguments now allow one to mix t rel¬ 
ative to any non-negative distribution, where the mixing distribution can be 
expressed as h{t)fa{t) for any non-negative function h{t), such that ]E[S'o,] = 1. 
Thus we say {Pk){k>i} has the distribution PKq,(/i • fa), if PKq,(/i • /„) = 
PD{a\t)h{t)fa {t)dt. This distribution is a Poisson-Kingman distribution based 
on a stable subordinator with mixing distribution h(t)fa{t) as defined in Pit¬ 
man [H,El- PD(a,0) arises by choosing h{t) = t~^Ca, 9 - 


1.1. Outline 


The paper will now progress as follows. In section 2 we will present a detailed 
description of the limiting joint degree distribution of preferential attachment 
models considered by Mori [37| and others. In section 3, we will present a formal 
description of the pertinent Markov chain under general PKq,(/i • fa) distribu¬ 
tions models. This parallels Pitman, Perman and Yor El|, Theorem 2.1] in the 
a-stable setting. Section 4 describes nested families of random partitions of [n] 
determined by a PD(a,0) sequence of Chinese restaurant processes. We intro¬ 
duce and obtain some distributional results for an interesting class of variables, 
in n, (^ra, 0 ) • ■ •) ^n,r)- These can be seen as mimics for vectors of degree sequences. 
We further describe some joint limits where the idea is one can then map to 
various constructions of trees and graphs. By a change of measure these results 
can extended to any PKq,(/i -fa)- Hence, this allows one to describe limits for 
models based on nested sequences of general Gibbs partitions of [n], [ilElEl- 
We present relevant calculations for this general setting in section 4.2. We par¬ 
tially view these contributions as helping to provide a blueprint for construction 
of models having more flexible properties as demonstrated by their limiting dis¬ 
tributions. We would add that it seems quite unlikely that one would be able to 
characterize (recognize) such limits by the usual methods. In Section 5, in terms 
of practical implementation, we can consider all possibilities in the a = 1/2 
case by obtaining explicit results for PD(l/2|t). Section 6 describes how to fur¬ 
ther embed/nest PD(q;(5, 9) nested schemes into PD(a, 9) nested schemes for any 
0 < (5 < 1, which in some sense offers a coalescent version of the results of [la. 

Remark 1.1. More details and related results in terms of basic properties of 
(>5'(/e_|_,r){r’>o} discussed in the unpublished manuscript of Jamesf^l. See 

there for connections to models where h(t)fa{t) = / Oi]te~^^ ' d'fait). 

The entire range ofPD{a,9), for 9 > —a, is obtained by randomizing f to have 
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a Gamina(^^, 1 ) distribution. 


2. The explict joint degree distribution of a class of linearly 
weighted graph and ^-recursive tree preferential attachment 
models 


We now describe the limiting joint degree distribution of the linearly weighted 
preferential attachment graph model obtained in Morils^- [See Athreya, Ghosh 
and Sethuraman for a more general extension]. Bertoin and Uribe-Bravo[l^. 
note that the model of [s^ is equivalent to certain recursive tree models as 
discussed for instance in Devroye [IS], section 5], As such, we lift the description 
of the scale free tree construction given in [1^. 

Fix /3 > —1, and start for n = 1 from the unique tree Ti on {0,1} which has 
a single edge connecting 0 and 1. Then suppose that T„ has been constructed 
for some n > 1, and for every i G {0,1,... ,n}, denote by d„(i) the degree of 
the vertex i G T„ Conditionally given T„, the tree T„_|_i is derived from T„ by 
incorporating the new vertex n + 1 and creating an edge between n + 1 and a 
vertex Vn G Tji chosen at random according to the law 

P(u„ = i|T„) = for i G {0,1,..., n} 

2n + p{n + 1) 


For reference we shall call these models jd—recursive trees. 

Let ^ denote convergence almost surely. From [1, 37| one has for any r > 0, 
that as n —>■ oo the joint vector 


... ,d„(r)) (^ 0 , 6 , • ■ ■ ,^ 0 , ( 2 - 1 ) 

where (^o,...,^r) has joint moments specified in Mori [^. Furthermore an 
important result for the scaled maximal degree is obtained 


See also Durrett [22| and van der Flofstad Section 8 ]. These results corre¬ 
spond to the model of Barabasi and Albert [ 6 | when /3 = 0. 

We will now show this model corresponds to components of a PD(a, 1 — 2a) 
distribution. From dm and (O, for each fixed j > 1, the density of Sap- 2 a 
is given by fa,j- 2 ait)i and furthermore 


r(^ + k)T{j + 1 - 2a) 
a.,-2aJ r(^)r(j + l-2a + fca)' 


Proposition 2.1. Set jda = 1 —2a > —a, and let (•S'o, “_|_i_ 2 a){r>o} denote the 
sequence o/PD(a, 1 — 2a) a-diversities satisfying the recursive identity 


q — a _ q — a p 


( 2 . 2 ) 


where Bj = <5'q, “_ 2 q,/'S'q, [^+i_ 2 a mutually independent Beta('2-^, i-^) ran¬ 
dom variables. Furthermore (i?i, ... ,Bj) is independent of 5'~“_2a for £ > j. 
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(i) Then for every integer r > 0, the joint distribution of the sequence 

/ Q—ct Q—a Q—cn Q—a Q—cn \ / 

—2q:’ —2q: —2q;’ ' • ’ t oc,r-\-l — 2a ‘^a,r—2ct) v 

is equivalent in distribution component-wise and jointly to the vector 

(^Oi ■■■! Cr) 

in 112.1]) . 

(a) It follows that, for Sa,- 2 a '■= 0. 


n “maxd„(i) max(S'„ 
2>0 2>0 


q-a \ 
^Oi,i-2a)’ 


(Hi) One may set fo = S^‘(_ 2 a o-n-d = S^ 2 - 2 a ~ ^a‘i- 2 a- Then noting that 
Bi is a symmetric Beta(i^, random variable, there is the distribu¬ 
tional identity 


6 = [1 - B,] ^ S-%_2^B, ^ Co 


(2.4) 


(iv) Note for {Sa, 9 , Sap+g) in aPD(Q;,0) sequence, the correspondence in 
only holds for the case 9 = 1 — 2a. 

Proof. One can verify [(i)] by checking that the joint moments of the vector 
in (12.3p correspond to joint moments of (CqiCij ■ • ■ j f,r) provided in [37|. However, 
while true, this is rather tedious. Mori(37j, Lemma 3] shows that for each j, 


(Co + • ■ • + fj)Bj = (Co + • ■ • + Ci-i)? where Bj has the same beta distribution 
as in the PD(a, 1 — 2a) sequence and (Co + • • ■ + Cj) is independent of Bj. Due 
to scaling, there are a myriad of potential solutions for the (Cj). Nonetheless 
the recursion in (1^ establishes the result if one can show that one can set 
?o = . This is true since ^a] = iE[Co"]■ □ 

Remark 2.1. The equivalence in distribution o/(Co)Ci) Provosition \2. 1\ was 
noted in f^ l. and is otherwise evident in the description of the j3-recursive tree. 
The result in shows the only PD(a, 9) case we could have considered is 

9=l-2a. 


Remark 2.2. The case /3 = oo corresponds to a —>■ 0 which, by continuity. 


lim PD(a, l-2a) = PD(0,1), 

a—>0 

yields the Poisson Dirichlet model PD(0,1). The rates become log(n) and fj = 1 
for all j > 0. 

Remark 2.3. f^l first provide an explicit description of these limits when 
(3 = 0, that is a = 1/2 and hence the PD(l/2,0). 
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3. PKa(h, • fa) Markov chains 


We now present the formal details of the Markov chain for under 

PD(q, 9) and under a general change of measure to PKa{h • as described 
in [31|. As was noted earlier, one can also deduce this from [26|. It suffices to 
work with the basic case of a PD(a, 0) sequence. Note from the recursion, there 
is the identity 

r 

5-0= n (3.1) 

k^l 

where here Bk are independent Beta( “^^~^ , variables independent of Sa,r- 
Proposition 3.1. For each r, let (Ta,o-,Ta,i, ■ ■ ■ ,Ta,r) denote a vector of ran¬ 
dom variables such that Tafi = Sa and there is the relationship for each integer 
k 

TaXk-l) = Ta,k X (3.2) 

where 14 has a Beta( °~*~^~^ , ^^-^) distribution, independent ofT^^k o-nd marginally 

Ta,k = Sa,k- Then, the conditional distribution of Ta,k given Ta,k-i = t is the 
same for all k and equates to the density. 


P{Ta,i G ds\Tafi = t)/ds 


{s/t)°‘ {s/t)°‘)‘'^F ^fa{s) 

t^ut) 


for s < t. By a change of variable v 
by 


P{Vi G dv\Tafi = t)/dv = 


= {s/t)°‘ the density o/I4|Ta,o = t is given 
a {l-v)^-^fa{v^/°‘t) 

r(i^) tut) ■ ^ ^ 


Furthermore (14,..., 14) O'Ce independent variables, independent of Ta^r- The 
sequence is a Markov chain, governed by a PD(a, 0) law. 


Proof. Because of the independence between 14 and Tq, ^ the proof just reduces 
to an elementary Bayes rule argument. Details are presented for clarity. The 
distribution of Tc,,k-i\Ta,k = s is just where 14 Beta( “^^~^ , -^^). 

Use the fact that for each k, Ta,k has density 


fa,k{s) 


r(fc + i) 

P(^) 


S ^Us), 


to show that the joint density of Ta^k-i,Ta^k) is, 


a‘^r{k)t 4'+i) 

P(^)r(i^) 


is/u-\i-is/U)^-^Us). 


(3.5) 


Now divide (13.51) by the fa,k-i{t) density of Ta^k-i, to obtain (13.31) . The Markov 
chain is otherwise evident from the exact equality statement. □ 
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Corollary 3.1. As consequences of Proposition\3A\the distribution of the quan- 
tities above with respect to a PKaih ■ fa) are given by and specifying Tap 
to have density h(t)fa{t). 

(i) In particular, the joint law of (Vi,..., Vf, Ta,r) is given by, 


]1 fBkivk) 

.fc=i 


h{s/Y\_vl^°‘)fa,ris)ds 

1=1 


(3.6) 


where fs^ denotes the density of a Beta( °'~*~^ variable. fa,r{s) = 

Ca,rS fa(.s). 

(ii) It follows that the conditional distribution ofTa,r\Vi,... ,Vr is proportional 
to 

(Hi) Relative to Ta,r, for each j = 1,2,... ,r 


1=3 

Remark 3.1. The fact that the quantity in 6\) integrates to 1, follows from 
the identity EH). Which reads as 




r 



= 1 . 


Remark 3.2. Comparing LI.3\} with Haas, Miermont, Pitman and Winkel 
Proposition 18, (ii), (Hi)] shows that under a PD {a, 1 —a) model, where for each 
k = 1,2,...; Ta,k-i = Sa,k-a, "Efe-i oQuates to the total length ofW^'^, say 
= T~^_^. Where is a member of an increasing family 

of leaf-labeled R-irees with edge lengths, arising as limits in Ford’s sequential 
construction. It follows from LI.2\) that, in this setting, (14) can he interpreted 
as 


Vk 


-=-;- — IDck l-a\ 


which is independent of II)(7^^’^]^) = T~‘^ = In fact (14, ■ • ■, 14) ore 

mutually independent and independent of . See f^I for a more precise 

interpretation of See also Fiji/ for related discussions involving fragmen¬ 

tation by PD(a, 1 — a) models. 


Remark 3.3. Note there are other distributions besides PD(a,0) that may pro¬ 
duce the same sequences (5'4e_|_^){r>o}- From a point of view of wide applicability 
of our results, this is rather fortunate. A key word in our exposition is version. 
The explicit constructions via bridges in fshl or the analysis of Fi3/ . in the 
PD(a, 1 — a) case, already verifies the existence of the appropraite versions of 
variables we identify via Corollarv \3.1\ with respect to a sequence of mass parti¬ 
tions ((i4,r){fe>i}){r>o} following a sequence of laws determined by PK„(/i-/a). 
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In the next section we will work with characterizing features of such families. 
Namely nested random partitions of [n] derived from the appropriate Chinese 
restaurant processes. 

4. PD (a, 6) nested Chinese restaurant processes 

For any fixed r > 0, set {Pk,r){k>i} ^ PD(a, 0 + r), and independent of this let 
{Uk,r)k>i be a collection of iid Uniforni[0,1] random variables. Then the random 
probability measure Pap+r{y) '■= ^k.r^iUk is a PD(a,0 + r)-bridge, 

also known as a Pitman-Yor process as coined in [29|. For j = 1,... ,r one may 
set Bj = which are independent Beta( variables 

independent of Pa,g+r- Let U(j/) = ?/ S [0,1] denote a Uniform[0,1] cdf. Now for 
each j define independent simple bridges 

X,{y)=BMy) + (.^-Bj)\u^<y} (4.1) 

Then the coagulation operation in can be encoded by the compositional 
identity, for each r > 1, 


Pa,d+r—l{y) — Pa,9+r{Xr{y^^ — Pa^g+r^B^y) 

where Pa,g+r{I — Br) has a Beta(l — a ,9 -\- a + r — 1) distribution. Further¬ 
more, one can show that Pap+r{Bry)/Pa,9+riBr) = Pa,9+ra{y) independent of 
Pa,g+riBr). More generally, for any r > 1, 


Pa,g{') — Pa,g+r Xr O • • • O Ai(-). 

It follows that if F~^ denotes a possibly random quantile function, then for 
every r > 1 

o ... o A-i O P-}+,i-). 

We shall use these properties to construct nested sequences of Chinese restau¬ 
rant process partitions of [n1 = {1, 2,..., n}. Note a dual fragmentation pro¬ 
cess can be deduced from [2l| which will produce nested partitions with the 
same distributions in reverse order. For a Chinese restaurant process following 
a PD(a,0 -|- r) distribution, the sampling scheme proceeds as follows. The first 
customer with index { 1 } is seated to a new table Ai^r- After n customers arrive 
in succession, a partition of [n], .. ■AKr,,r,r) where Kn^r < n are the num¬ 

ber of distinct blocks in the partition, and Ni^r = \Ai^r\ are the sizes of each 
block, is produced. Given this configuration, customer {n + 1} is seated to a 
new table with probability {9 + r + Kn^rO:)l{0 -I- r -|- n) and sits at an existing 
table Ai^r with probability {Ni^r — oi)/{9 -I- r -|- n), for z = 1, ..., Kn^r- We now 
describe the combinatorial scheme we have in mind. 

Nested PD(a, 9) partitions of [n] 

(i) For any r > 1, draw a random partition of [n], {Ai^r, ■ • ■ > from a 

PD{a,9 -I- r) Chinese restaurant process scheme. 
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(ii) Draw {Ui j., ■ • ■, ^ iid Uniform[0,1] variables. 

(iii) Recall that Ur is the atom of A,., and has a Uniform[0,1] distribution. A 

PD(a, 9 + r — 1) partition of [n] ■ • ■, is obtained as 

follows. Blocks of (Ai_r, ■. ■, AKr,^r,r) are merged into a set defined 

as 

A'l,r-l={A^,r-X-\Ulr) = Ur}, 

if is not empty, set Ai^r-i = ^_i,the remaining — |^i r-i \ — 

— 1 1 blocks of ,..., : are relabeled A‘ 2 ^r ‘— i ^ ■ • ■ ? 7 -—. 

If A'l = 0, Kn^r-i = Kn,r and one sets A^^r-i = ^fc,r for k = 

1 ' K 

X, . . . , 

(iv) Repeat steps [(ii)] and [(hi)] for r—1, r—2,..., 1 to obtain nested partitions 
of [n] following PD(q!, 9 + j) marginal distributions for j = 0,... r. 

Remark 4.1. Kuba and PanholzerJsA. Proposition 3] point out that partitions 
generated by the PD(Q!,d) Chinese restaurant can be equally generated by the 
growth process of generalized plane-oriented recursive trees. As such, some vari¬ 
ation of our scheme can be used to produce nested version of such trees. 

Remark 4.2. Note in the general PKQ,(h • fa) setting, one would use simple 
bridges defined as 

^k{y) = VkU{y) + (1 — 

for Vk = These are the same entities subject to a change of mea¬ 

sure, where generally independence no longer holds. 

Remark 4.3. It is a simple matter to show that Pa, 9 +r{-) converges almost 
surely to U(-) as r ^ oo. Thus implying that A^ o • • • o Ai(-) converges almost 
sure to Pa, 9 . See James Fil . Section d.j and Proposition 6.6] for distributional 
results related to the composition of bridges Xj. o ■ ■ ■ o Ai(-). 

4.I. Mixed Binomial distributions, p-mergers and [J-splitting 

Note the nested scheme described above provides nested versions of all the 
statistics generally associated with random partitions, and appropriate limits. 
For brevity we shall only concentrate on results for the sequence of the number 
of blocks {Kn,r){r>o}- Let Bin(m,p) denote a Binomial distribution based on 
TO Bernoulli trials with success probability p. We now describe results for Kn^r- 
The first result is immediate from the description of the nested PD(a, 9) scheme. 

Proposition 4.1. For every r > 1, consider the the blocks [Kn^,..., Kn^r) 
produced by a nested PD(a,0) scheme. It follows that for each n Knj-i < Kn,j 
for j = 1 ,... r with properties; 

(i) For j = 0,... ,r the marginal distribution of each Knj is exactly that of 
the number of blocks of a PD(a, 9-\- j) partition of [n[. For j = 0,..., r — 1 

Kn,j = {Kn,j+1 - \ A'i^j \ + l)I{|Ai^j|> 2 } + 
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(a) For each j the conditional distribution of j_i| given (Kn,j,Bj) is 
I - Bj) 

Note that Proposition 14.11 shows that in step[(iii)] of the nested PD(a, 9) 
scheme one is repeatedly performing some sort of pa,e+j = Bj merger in the 
language of Berestycki [71, p. 69-70]. That is £ of the {^ly,. ■. blocks 

are said to coalesce if = i > 2. The next results, which follow from 

elementary calculations, describes some more details about the distributions of 
j_i \ and random variables (^n.Oi > ^n,r) we define as follows. Set 

£.n,0 = Knfil{\A'^ „\>2} = {Kn,l - |>llol + l)I{|^'i.ol>2} > (4-2) 

and for j = 1,..., r, define, 

Craj ■= BinJ — KnJ-l = ^ | >2} (4-3) 


As the notation suggests these are meant to be thought of as mimics for degree 
sequences. Write the Beta(4^, density. 


Pa,e{v) = 




r(^)r(4^) 




Proposition 4.2. In the PD(a, 9) setting of Proposition [7T7] the general distri¬ 
bution of \A'-^ qI, given Kn,i = b is a mixed Binomial distribution Bin(6,1 — Si), 
where 1 — Bi is a Beta(4^, random variable with density function pafi{v). 
Hence the probability mass function of \A'-y oli 


Pa,e{^\b) 


A\ r(iM)r(g^ + 5-£)r(i + ^-i) 
W r(^)r(4^)r(4M + 5) 


Furthermore the distribution of \A'i g|, given Kn i = b, \A[ qI ^ 2, is for 2 < £ < 

b, 


Xc.,em 


_ Pa,si£\b) _ 

1 - Pa,9{0\b) - Pa,e{Mb) 


(4.4) 


The corresponding conditional distributions of the random variable Knp — 
IqI can be expressed as p'^g{£\b) := Pa,e{b — £\b) and A)!) g{£\b) := Xa,d{b — £\b) 
respectively. Replace 9 with 9Fj — l to obtain corresponding results for |A'j^ 
given Knj 


(i) In the Brownian cases, PD(l/2,d), 9 > —1/2, 


Pl/2,9Wb) 


{29 +1){29 + b - ey.bi 
{29 + b + T)l{b - £)\ 


In particular pi/ 2 ,o{j\b) = l/(&-|-l) is the discrete uniform distribution on 
{0, ...,&}, and Pi/2,i/2{£\b) = 2(6 -f 1 — £)/[{b + 1)(6 -|- 2)]. 
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(a) In the limiting Dirichlet case, PD(O,0), 

PofiWb) = - P9f~^ 

is a proper Binomial distribution with success probability pg = 1/(0 + 1), 
for 0 > 0. 

Note starting from some PD(a,0 + r) it follows that every layer of our 
PD(a, 0) nested scheme produces proper consistent infinitely exchangeable par¬ 
titions in [n]. As such, one may view our scheme, as discrete time coalescent pro¬ 
cess based on a sequence of merger rates determined by measures 
where from \a,g{I\b), in (14.41) . 


Aa,g{dv) 


r(^) 

r(^)r(i^) 




For a fixed time, these schemes are suggestive of relations to a class of Aa^g- 
coalescents where Aap corresponds to a Beta(i^, ^^)-coalescent. However 
our models should not be confused with such processes. Rather, our models are 
also identified as continuous time coalescent processes by defining as waiting 
times ...) in reverse order as written, 

and viewing our exchangeable bridges Pa,g+r{-) '■= ^ 5 -“ (•) as, stochastic flows 

of exchangeable bridges, , (JA t > ^ in the spirit of the simple bridge construc¬ 
tions of Bertoin and LeGall [91 [llj,[l^ and their connection to Fleming-Viot pro¬ 
cesses. However, our constructions of the analogous simple bridges are quite 
different as the y a re based on ratio’s of local times, {Br = 

See remarks in [l^ for the PD(O,0) case. This particular identification of wait¬ 
ing times, is, as we shall explain next, hidden in the literature. Again all such 
interpretations hold in greater generality under PK^ih -fa)- Placing things in 
reverse increasing order, dual fragmentation processes of partitions of [n] can be 
obtained by appropriate splitting rules. We shall not attempt to formulate this. 
However one notes that with respect to mass partitions induced by our schemes 
there is a natural dual connection to the fragmentation schemes described in [ 2 ^ 
pages 1831-1832]. Formally, their Proposition 18 and Corollary 19 describe the 
same waiting time structure, in the form of tree edge lengths ^0 := and 

£,r ■= S'//for r > 1 , interpreted in terms of renewal sequences in 
the PD (a, 1 — a) setting. Upon a change of measure, this makes sense for our 
nested schemes in the general PKq,(/i • fa) setting. We will describe some more 
details for general a in section 4.2 and very explicit details in the PD(l/2|t) 
case in section 5. We next establish a connection with Aldous’s /3-splitting rule 
for /3 > —1. 

Proposition 4.3. The conditional probability mass functions {pafi,p'^ g), and 
hence also (Aa.e, A/( g), are equivalent if and only if 0 = 1 — 2a. That is Bi = 
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(1 - Bi) - Beta(i^, Where /or 2 < £ < 6, 

^b\ r(2^)r(i^ + b - ^)r(i + £ -1) 


K.l-2a{£\b) OC Pa,l-2aii\b) = 


r(^)r(i^)r(2^ + 6) 


(i) Setting fia = 1 — 2a > —a it follows that 

Pa,i-2a{i\b) OC gf forl<^<b-l, 

equating to the splitting kernel in the S-splitting model of Aldous 0 / for 
the range /3 > —1. Here we use the notation in 12a . p.l824], 

(ii) Hence, the Yule model case of jd = 0 corresponds to a PD(l/2,0) model 
with Pi/2,o(^l^) = l/(^+ !)■ The symmetric random trie case /3 —>■ oo 
corresponds to a PD(0,1) model with pQ^i{£\b) = (^)(l/2)^. In particular, 
for b>2, 

~Aidous- 0 (^) = ^ and “(£) = (^) 

Remark 4.4. For more on A-coalescents see 0, Ql, f^/ - 

Remark 4.5. See 01 . [^ . [s^/ /or more on theB-splitting model and its con¬ 
nection to fragmentation trees. In addition See fA aJ . Section 3.3,and Propo¬ 
sition 27], Fia . Proposition 18] and fill , p. 1738] which describe relations to the 
a-model of Ford 03/ and the Brownian CRT of Aldous UM]- 


4 . 2 . Calculations for general PKa{h • /„) nested Gibbs partition 
schemes 

As we have discussed above, viewing in reverse order, as times 

where mergers occur gives the formalism to recognize our nested scheme as 
continuous time coalescent processes based on a sequence of merger determin¬ 
ing measures (Aa,e+r){r>o}) that produce, at each merger time, consistent ex¬ 
changeable partitions of [n]. Conditioning on Sa^e = t, gives in a distributional, 
rather than operational, sense the relevant quantities emerging under PD(a|t). 
schemes. Operationally, since consistent families of Chinese restaurant parti¬ 
tions are produced under this setting, one carries out the same nested scheme 
under a PD(a|t) distribution with time points T“q = t~°‘ and ('P,/“){r>o}- 
PKq(/i • fa) nested schemes proceed in a similar fashion. Thus all producing 
coalescent schemes which are based on sequences of merger rate determining 
distributions (1 — V/) whose distributions generally depend on the number of 
the blocks considered at the time where mergers are to occur. The exception to 
this are the PD(a,0) distributions, which represents the only instances where 
the pairs (14, Ain,/c) are independent. We now describe more explicit details of 
these distributions. 
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Recall from Proposition 9], see also [24l44|. that the EPPF of a PDlaltl 
partition of [n] with generic blocks {Ai ,..., Ak^), respective block sizes nK„ '■= 
(ni,..., riK^), and Kn = b blocks, can be represented as 


Paini,...,nb\t) = n TTT-^ 

r(i - a) 

where = (r(n — ba)fa{t))~^a’^t~^ f* fa{t — ?;)(1 — v)'^~^°‘~^dv. While 

the expression for seems to be generally intractable, the work of Ho, 

James and Lau shows that this quantity may be expressed in terms of 
special functions corresponding to Fox-H functions in the general a setting and 
Meijer-G functions in the case of a taking on fractional values. Thus extending 
the observations of Pitman for the case of a = 1/2. Furthermore, they 
show that = (r(n)/„(t))"^r(6)a'’"Va.(n,6)(0i where fa,(n,b){t) is the 

density of the product of independent random variables. 


^ ^ (4.6) 

which interprets as the conditional density of Sa,o\Kn = b when the pair 
{Sa,o,Kn) := {Sa,o, Knfi) are interpreted with respect to a PD(a, 0) distri¬ 
bution. Note the identity on the right hand side of (14.6p follows fromj^, eq. 
(2.11)][see also [i^]. This leads to the identity fa{t) = YlLi ^a,o{Kn = b) fr, i-„ ^ ft), 
where ¥ap{Kn = b), denotes the distribution of Kn under PD (a, 0) given in [4^ 
or otherwise deduced from (14.5|) . Hence 


Pa{ni,.. .,nt\s) 


fa,{n,b) (s) 

/a(s) 


Pap i'bbl 5 • ■ • 5 'bib} 


(4.7) 


where Pa,oin-i, ■ ■ ■, bib) is the EPPF under PD(a, 0), and 


^a{Knp = b\Sc.p = S) = ^^^pp^FapiKn = b) 

fa[s) 

It follows that by integrating (14.511 with respect to h{s)fa{s), that for a general 
PKq(/i • fa) distribution its EPPF, and distribution of Kn, can be represented 
as 


p[!f\nb) = Wn,b X Pa,o{nb) and F^a\Kn = b) = Wn,b X ¥a,o{Kn = b) 

where, as in [^, Theorem 4.1] suppressing dependence on h, Wnp can be ex¬ 
pressed as 

Wn,b = 'Fap[h{Sa,o)\Kn,0 = &] = E[/l(Ea_„ (6a))]. 

See also Gnedin and Pitmanj^ for other representations. The next result fol¬ 
lows from our discussion. 

Proposition 4.4. Suppose that the law of {Knp,Tap) is determined 67 /PKa(/i- 
fa)- Then, 
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(i) the joint distribution of may be expressed as 

Pa.0(J^n = X h{s)Uis). 

fofs) 

(ii) Hence by Bayes rule the conditional distribution of Ta^Q\Kn,o = b is given 
by 

= ^nlKs)fc,,(u,b){s)- 

(Hi) Statement [(ii)] implies that for any integrable function g 

[g{Tap)\Knp = b] = ^o!,o[giSap)h{Sa,o)\Knp = b] 


where Eq,_o denotes expeetation relative to the distribution of {Kn,o, Sa,o) 
defined under a PD(a, 0) distribution. This may be also expressed as 

= b]= E[g(S„.„(6a))/i(E„,„(6a))]W^-i 

Proof As we mentioned, the first two statements are just simple consequences 
of our description of the conditional EPPF. Statement [(hi)] follows by a simple 
manipulation of s“®/Q,_(n,_f,)(s). □ 

We now specialize to the important PD(a, 9) case. This result was originally 
deduced by |28l |. 

Proposition 4.5. Suppose that the law of {Knp, Tap) are determined by PD(a, 9) 
case where h{s) = s~^Ca,g. Henee one may set Tap = Saft- It follows that 


Wn,b = CaP^^a%{ba)] 


r(n)r(e + i)r(g^) 
r(0 + n)r(6)r(^) 


and [s ^Ca,dfa,{n,b){s) eorresponds to the conditional density of the random 
variable Sa,d\Knp = b, which is eguivalent in distribution to, 


^a,n{0 + ba) — Sa,e+baBg^j^^^^_i,^ — Sa,n+eB^ _gy 

We now present the main result in this section which follows from these facts. 

Proposition 4.6. Consider the variables described in Provoosition \3.1\ (Tf)!) f^^^r,\. 
and consider the nested PD(a, 9) scheme where mergers and hence consistent 
partitions of [n] occur at time points for r decreasing. Then condition¬ 

ing on Sa,e = t, the law of the nested schemes are determined by a PD(a|t) 
distribution, and in fact are equivalent in (conditional) distribution to a nested 
family of partitions that was initially constructed from anyPKa{h-fa) distribu¬ 
tion. Operationally, under PI)(a\t) the consistent family of nested exchangeable 
partitions can be generated for any r > 1 by generating a Chinese restaurant 
process partition of [n] according to the Poisson Kingman law defined by mix¬ 
ing PD(a|s) relative to the density of Ta^r\Tap = t, which can be determined 
from Proposition \3.1\ Denote this density as = hr(s|t)/a(s) and the 
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corresponding Poisson Kingman law as PKa{fa)r) This represents the config¬ 
uration of the partition of [n] at the time T~f. Mergers in the past occur at 
times {T~_f [,..., T~q)\T~q = according to simple bridges constructed from 
{Vr ,..., Vi)|rQ,_o = t, producing partitions of [n] with marginal laws determined 
by a PKa(/^*^) EPPF for j = r — 1,..., 0. The distributional properties of the 
relevant quantities in Proposition f.l are now described. 

(i) For each r > 0, the conditional distribution of Kn,r given (Ta^r, ■ ■ - Tap) 
only depends on Ta^r and is given by 

^c.{Kn,r = h\T^^r = s) = f = h). 

U{s) 


(ii) given {Kn,j,Vj,Taj-i,.. .Tap) has a Binomial(Xrij, 1 — V^) dis¬ 

tribution. 

(Hi) For any integer j, the distribution of {Knj, Vj)\{Taj-i = s,... Ta,o = to) 
only depends on Taj-i = s and is the same in form as the case where 
j = 1, with Tap = s. 

(iv) From [(i)] and using jS.fll, the joint distribution of{Kn,i = b, Vi))|rQ,_o = t 




fa{tv^i<^) r(i^) 


tfa{t) 


which reduces to 


ypi^n — 


r(i^ 
^ a 


tfa{t) 


(v) Noting, from Proposition \f.5\ thatt ^fa,{n,b)(t)/^[T^ami^O')] is the density 
of + ha) it follows that 


v)^-^fapnP){tV^^‘^)dv = E[I]-lj&a)]/y„,„(,)(t) 
where fy^, „(b)(i) denotes the density of the random variable, 


(vi) Ya,n{b) is the random variable corresponding to the conditional distribution 
of Sap\Kn,i = b, where {Sap, K^p, Kn,i) follow a PD(q;, 0) distribution, 
and otherwise Sap = 

(vii) It follows that the conditional distribution of Kn,i\Tap = t is given by 




=t) = Fap{Kn 


b) 


(l-a)E[S-UH]/r.,„(b)(t) 

r(^)/aW 
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(via) Hence, the conditional density of (1 — Vi)\Kn,i = b,Tafi = t, is given by, 
for 0 < p < 1, 


p ‘^Aa{dp\b,t) 


(1-a) E[I]^i„(&a)]/y^_„(b)(t) 


which is the same as the distribution of {1 — Vk)\Kn^k = b,Ta^k-i = t, for 
each k > 1. 

(ix) It follows that the distribution of\Ai g| given = b,Tafi = t is specified 
by its probability mass function 

P„(|A;^oI = = b,T^,o =t)= Q j\^-\l-pf-^KMbA)- 


We close this section with the following corollary. 
Corollary 4.1. Suppose that Tap has density h{t)fa(t) then, 
(i) the conditional density of Tap\Kn,i = b is 


U,K„At\b) 


^( 0 / f „,„( 6 ) (t) 

E[h{Ya,n{b))] 


(a) It follows that the conditional density of (1 — Vi)\Kn^i = b is given by, for 

0 < p < 1, 


p '^Ka{dp\b) 


+ 5a)(l - p)-i/“)] 
(1-a) E[h{Ya,n{b))] 


(Hi) The conditional density of {1 — Vi)\Kn,i = b can also be expressed as, for 

0 < p < 1, 


p '^Aa{dp\b) 


E„,o[fe(5„,i(l = b] 

(1 - a) Ea,o[h{Sa,o)\Kn,l = b] 


(iv) Hence when h{t) = t~^Cafi, corresponding to the PD(a, 0) case, the condi¬ 
tional distribution of {1 — Vi)\Kn,i = b has a Beta(^-^, distribution 
independent of Kn,i- 


4-3. Limits of joint vectors under the nested PD(q;, 0) scheme 

Our constructions allow one to utilize well-known results to easily establish limit 
theorems. We shall present one such result here. We again note that these results 
hold in the more general PKq,(/i • fa) setting 

Proposition 4.7. Let (S'~g+r){»’>o} denote the PD(a,0) sequence assocated 
with the nested scheme. For each r > 0 let {Kn,r), i^n,o, ■ • ■, in,r) be as previously 
defined. Then 
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(i) As n ^ 00 , jointly and component-wise, for any r > 0 

n-“(K„,o, • • •, J^n,r) “4- (5-^,..., 5-;+,) 

ftij As n —>■ oo, jointly and component-wise, for any r > 0 

• ■ • ,f,n,r) 4' (Co,Cl, ■ ■ ■ , ^r) 

where = S~^, and £,j = for j = l,...r. 

(Hi) One may replace ^n,o with Kn,o 

Proof. It suffices to show [(i)] as the arguments for [(ii)] are similar. We first note 
the fact that n~°‘Kn,r 4' We may also consider the {Bj) to be hxed. Us¬ 

ing the notation X 4' Y to mean X/Y —>■ 1 a.s., it follows from ProDOsition l4.ll 
that Kn^r-i 4' Yr, where W = hd Bernoulli((5^)) variables. 

Dividing by Kn^r it follows that W = Kn.rL-, hence n~°‘{Kn^r-iiKn,r) 4' 
n-°^KnA^rA) 4- wffich is Continuing in 

this way it follows that Kn,j-i 4' Kn,r 0^=1 4 for j = 1,..., r. The result is 
concluded by applying the law of large numbers to a vector of Bernoulli sample 
means and utilizing the definition of (S'~g+,,){r>o}- D 

By taking a —^ 0 and known results for PD{{), 6) distributions we obtain the 
following corollary 

Corollary 4.2. Set a = 0 in Proposition Then this yields results for a 
PD(0, 0) nested scheme as follows: 

(i) As n ^ 00 , jointly and component-wise, for any r > 0 

(Kn,o, Kn,i, ■■■, Kn,r) 4' log(n)( 6 », 9 + 1,..., 9-\-r) 

(li) As n —>■ oo, jointly and component-wise, for any r > 0 
(^n.o, 4 . 1 , ■ • ■, 4 .r) 4- log(n)( 6 », 


5. The PD(l/2|t) case 

We now describe all possible limits under any scheme that is asymptotically 
equivalent to our nested schemes based on PKi/ 2 {h- / 1 / 2 ) distributions. This is 
done by looking at their common generator under the PD(l/2|t) model. Note 
again that if is the number of distinct blocks in a partition of [n] following 
a PD(l/2|t) distribution then n~^^^Kn 4' Xj^Jt. See Pitman [i^. Section 8] 
for a nice treatment in regards to the Brownian excursion partition. The simple 
properties of the PD(l/2|t) distribution related to the Markov chain in Per- 
man,Pitman and Yorjdy are also exploited in [1, 13. What we are doing here 
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is expressing properties of this distribution relative to operations correspond¬ 
ing to Proposition 13.11 Recall that <S'i/ 2 ,o = ^ 1/2 = l/(4Gi/2), where G 1/2 is a 
Gamma(l/2,1) variable. This means that 


/l/2(i) 


^ .-3/2 -l/4t 

2 r(i/ 2 ) 


Throughout, let (efc) denote an iid sequence of exponential(l) variables. 

Proposition 5.1. Consider the setting in Provosition \3.1\ then the joint distri¬ 
bution of (T'i/ 2 ,fe){fc>i} | 7 i/ 2 ,o = t is described as follows. 

(i) Setting a = 1/2 in SS.S\) . for each k > I, the conditional density of 
Ti/ 2 ,k\'^i/ 2 ,k-i = tk-i is given by 

/Ti/ 2 ,dTi/ 2 ,.-i(^fel^fc-i) = \tk^e~^e^^ for 4 < tk-i- 

(ii) It follows that (T'i/ 2 ,fc){fe>i} | 7 i/ 2 ,o = t correspond to the points of an in¬ 
homogeneous Poisson point process with intensity 

T-{s\t) = (5.1) 

(Hi) Hence, conditional on Ti/ 2,0 = tt fw each k > 1, one can set 

1 

+ lA 

(iv) From m\) it follows that 2 |Ti/ 2 ,o = t o,re the points of 

an inhomogeneous Poisson point process with intensity 


p{y\t) = (5.2) 

Proof. The results follow by the simplifications unique to the a = 1/2 case via 
the density fi/ 2 {t) as applied to the transition density in (13.31) . □ 

Remark 5.1. Setting Ci = l/v^, Ck+i = 1/y^2H\J^, fork > 1, the spacing 
Ck+i — Ck can be interpreted as the distribution of the edge lengths in Aldous’sl£, 
p.278] construction of the Brownian Random Tree (CRT) when conditioned on 
Cl = 1 /A^. .4s is well known Aldous’s construction corresponds to randomizing 
in the PD(l/2,l/2) case. However, using our explicit descriptions in Proposi- 
tions \5.R such a construction makes sense for any choice oft, yielding different 
trees. 

We next describe some implications of Proposition 15.II 

Proposition 5.2. For a = 1/2, consider the setting in Provosition \3.1\. The 
initial mass partition (Pfc,o)| 2 \/ 2 .o = I follows a PD(l/2|t) distribution. A de¬ 
scription of the corresponding variables is provided as follows. Given, Ti/ 2,0 = 
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(i) = -^461 + \/t and for k > 2, 


T, 


-1/2 
1/2,fe 


. 4y^ + l/t 

\ e=i 


(ii) Vi = l/i/4tei + 1, with density, for 0 < m < 1, 


.... 1 ( 1 -“/^) 
fv^{u\t) = —u ^e 


(5.3) 


(Hi) For k >2, 


Vk = 


\jet + l/t 

\J'^L!1=i + lA 


i^or everz/ r > 0, i/iere exzsi random vectors (^n,o, ^n,i, ■ ■ ■, ^n,r) defined 
under a PD(1/2|<) partition scheme, as in and such that, 

• • ■ ,f,n,r) (Co,Cl> ■■■,&) 


where = l/v^, Ci = + l/t — l/v^, aurf /or k >2, 


(k 


k 




4^ ef + l/< 

i=i 


k-l 




4^ ef + l/t 

fci 


/r/ Results for any PKi/ 2 (^ ■ / 1 / 2 ) distribution are obtained by randomizing t 
with respect to /i(t)/i/ 2(0 which can be any non-negative distribution. 

Remark 5.2. Notice that fn,o not fixed, rather ^j'Jt. 

Remark 5.3. It is now easy to see that the limits for the M and L pref¬ 
erential attachment models in correspond to the case o/PD(l/2,0) and 
PD(l/2,l/2), respectively. 


With respect to the calculations in section 4.2, it follows that 


^ 1 / 2 ,nO + V2) — 4G'„+3/2i3^2+6.2n-f)) “ 4G'(b+3)/2.B((2+f,)/2,(2n-b)/2) 

and 

Yi~/Ub) = + 6 / 2 ) = 5 ( 1 / 2 ,+ b/2). 

From [ 2 I, section 3.1] we have that = (r(n)/i/2(t))~^r(/c)2“^+^/i/2,(n,fe)(t), 

is expressible in terms of a ratio of Meijer G functions as 


/ 

l+fc _k\ 


G°:? 14t 


-n y 

(4t)-^-ie-iit/(-|+n,i,i) 

G?:J| 



(4t)“ie“® 

\ 

V 

—/ 
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where U (a, 6 , c) is the confluent hypergeometric function of the second kind (see 3J, 
p.263]). The above ratio reduces to 

—/r-t-1 1 ——4-—rr f k \ 11 

2 '=+ 1 ^ 2 + 2 [/ _ 

\ 2 2 ’ 2 ’ 4t 


via an application of the recurrence relation p.505] 

U{a, b, z) = + a-b,2-b,z). 

A change of variable t = yields the expression 2 "“''A^“^hfe+i_ 2 „(A) inside 

equation (110) in Pitman where hi,{\) is the Hermite function of index v 34, 
Sec. 10.2], based on the following relationship, 


K[\) = 2’' 


1 ^ 
2 ’ 2 


Hence from Proposition 14.61 it follows that the joint distribution of {Kn,i = 
k, Vi))|Ti/ 2 ,o = can be expressed as. 


*i/2,o(-^n — k) 


2T{k) 


rr,kl 11 

2 2^'"’2’4tn2 


where, as in Pitman 43l| . 

/: 

11*1/2,o(-^n = ^) = ( 


2 n — fc — 1 
n 


2 *+ 


l-2n 


6 . Nesting across a 


Recently, within the context of /c-stable trees, for 1 < /c < 2, Curien and 


Haas [iTj, showed that one can construct all the stable trees simultaneously 


as a nested family. In particular for 1 < k < k' < 2, they relate a K-stable 
tree with a K'-stable tree rescaled by an independent Mittag-Leffler type dis¬ 
tribution. We suspect that hidden in such a story are operations induced by a 
PD(a, —aS) — Frag operator as described in Pitman Theorem 12], for some 
choice of 0 < (5 < 1 and 9 > —aS. Rather than pursue that, we describe how 
one can produce nested PD(ai5,0) partitions from PD (a , 0) nested schemes via 
the corresponding PD(5, -) — COAG operator. See also Corollary 10]. 


Proposition 6.1. Suppose that a sequence of nested local times (S'~e_|_r){r>o} 
correspond to the states of a Markov chain (PD(a,0 -|- r)){,r>o} produced by a 
PD(a,0) nested CRP scheme as described in section 0 Let ((Qfc,r){fc>l}){r>0} 
denote a nested sequence of mass partitions following precisely a nested sequence 
of laws (PD(i5, ^^)){r>o} whose relations will be described more formally be¬ 
low, and where for each fixed r, {Qk,r){k>i} independent of the correspond¬ 
ing PD(a, 0 -|- r) mass partition. Let Pg ttn denote corresponding bridges. A 
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PD(a(5, 0) nested sequence of partitions of [n] with correspond¬ 

ing to the states of a Markov Chain (PI){aS,9 + ?’)){j->o}) can be obtained in a 
distributional sense from the PD(a,0) nested CRP scheme by using the coagula¬ 
tion operation of Pitman encoded in the composition of independent bridges 
PaS,e = Pa, 9 ° Pg s., where Pg ^ , corresponds to the PD((5, — COAG operator. 

More precisely this is encoded in the ordered operation of coagulations described 
by the relations 


p-i 

^a5,e 


= PF„oP-1 = P7^ 






O O ... O A, 


-1 

—1 


oP. 


-1 

a,9-\-r 


where the simple bridge Xa,e{y) := Ai(y), as in ra- A scheme that directly 
relates each pair {Sa, 0 +r-: SaS, 9 +r), and hence (PD(a, 9 + r), PD(a5 ,9 + r)), for 
each step r, can be deduced from the following results. 

(i) There is the identity 


Xa,e O Ps,±i‘) = ° ^aS,9{-) 

(ii) Statement [(i)] means that Xa,g o Pg e_{y) is equivalent to the bridge 


where U[ = Pf~l_{Ui) is the UniformlO, 1] variable associated with the first 

size biased pick from a PYX{5,9/a) distribution, and is further equivalent 
to the bridge representation 


+ (1 - = Pg i^{XaS,9{y)) 

(Hi) The equivalences are then encoded by their local times or S-diversities, 




(iv) The identity PaS,e{-) = Pa,e o Ps,^{-) = -Pa.i+e o Pg i+e o XaS.eO shows 
that the a6-diversity of P^s ,9 can be represented as. 


' 1 —a.S 

Dt6,9 


'f — CtS Q — S 

^,0 


q ;(5 q—5 


-) 


-y—aS 

^aS,l+9-^{i 


A 


(v) Replacing 9 with 9-\-r establishes relationships between (5'“g_|_^){r>o} and 
i^a^Lr){r>o} ■— {‘^a'g+r ^ ^^B+r){r>o} la the respective nested schemes. 
Which is encoded by. 


-P„,A = P.l o P^ \= A^^^g o • • • o A ^ 


ot5.,9 


Q:(5,@+r —1 


oP: 


o p 


-1 

OL,9-\-r 


where = ^5 £±r ° ^a.e+r> the simple bridge XaS, 9 +r-i, 


S 


— CX.5 


-1-5 


l-a.S\ \ = 


a.,9-\-i — 1 *^( 5 ^ e+r-i 

Q—a6 q—5 
‘^a,9-\-r‘^S ^ 
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for r > 1. Note, o ° ^5 1±£> which follows 

from [(i)]. 

(vi) The relationship between the nested sequence of CO AG-operators, follow¬ 
ing the laws (PD(5, ^^)){r>o} encoded by the local time relations 



_ q—6 d—<5 

— i+e -^^e+. 


substituting generally 9 with 0 + r — 1. This follows from[(iv)] by using the 
identity i-ay where the variables on the right are 

not independent. 


Proof. Statement [(i)] is a just a consequence of Pitman’s coagulation operation 
combined with the simpler coagulation operator in 2 l| as follows. Pa^s = Pa,6 o 
Ps^± = Pa,i+e o Xa,e o Ps ±. Also by ^ Pa,i-ee o Ps^i±e = Pa5,i-ee- So for each 
y there is the identity of distribution functions 


Pa5,6{y) — Pa,l+6{Xa,9{Ps,^{y))) — Pa,l+6{Pgl+±{XaS,o{y))), 

which establishes [(i)]. Although equivalent, statement [(ii)] does not directly 
appeal to [(i)], and offers a result by direct construction. By definition, Xa^g o 
Ps,§. (y) is given by 

+ (1 - BG-^,l^))\p-l(u,)<y} 

where U'l = P~]_ {Ui) is, as noted, the point associated with the first size biased 
pick from PD(^, ^), say 1 — Wi following a Beta(l — 5,^-\- 8) distribution Use 
U[ = P~l (Ui), and otherwise replace Pg §_ with its stick-breaking representation 
Pg,<L{y) = WiPg^ A+siy) + (1 “ ^i)\u[<y}- Combining terms, one sees that 
for B = B^e±^ iizs.)) it remains to evaluate the distribution of BWi and (1 — 
Wi)B -\- {1 — B) = 1 — BWi. It follows by the usual Beta Gamma algebra that 
BWi = B^ e+as i-as y The remaining equivalence is apparent either by direct 
evalution or an appeal to Proposition 21]. Statement [(iii)] follows from 
this and the remaining statements have been explained within the text of the 
Proposition. □ 

We close by noting some connections to the Bolthausen-Sznitman coales- 
cent [ 1 ^ . 

Corollary 6.1. Consider the dynamics of the Bolthausen-Sznitman coalescent, 
or U -coalescent, as described in for instance fl^ . Theorem I 4 , Corollary 15] 
or Then setting a = e ^,8 = e ''i and 9 = 0 in Proposition [Kjl 

establishes various relations between the U -coalescent and corresponding nested 
CRP schemes. 

Remark 6.1. This section is partially influenced by recent interactions with 
Anton Wakolbinger and Martin Mohle, whom I thank for their time. One sees 
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that Q'f,t>0) is a version of Mohle’s Mittag Leffter process. Setting 

{Knit) := N^^\t > 0) to match with the notation in denoting for fixed t, 
the number of blocks of a PD(e“*, 0) partition of [n], one has for n —>■ c», 

Knit) N^^\t) a.s. 

as was established in and otherwise corresponds to results for the e“*- 

diversity. 
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